Change Points via Probabilistically Pruned Objectives

نویسندگان

  • Nicholas A. James
  • David S. Matteson
چکیده

The concept of homogeneity plays a critical role in statistics, both in its applications as well as its theory. Change point analysis is a statistical tool that aims to attain homogeneity within time series data. This is accomplished through partitioning the time series into a number of contiguous homogeneous segments. The applications of such techniques range from identifying chromosome alterations to solar flare detection. In this manuscript we present a general purpose search algorithm called cp3o that can be used to identify change points in multivariate time series. This new search procedure can be applied with a large class of goodness of fit measures. Additionally, a reduction in the computational time needed to identify change points is accomplish by means of probabilistic pruning. With mild assumptions about the goodness of fit measure this new search algorithm is shown to generate consistent estimates for both the number of change points and their locations, even when the number of change points increases with the time series length. A change point algorithm that incorporates the cp3o search algorithm and E-Statistics, e-cp3o, is also presented. The only distributional assumption that the e-cp3o procedure makes is that the absolute αth moment exists, for some α ∈ (0, 2). Due to this mild restriction, the e-cp3o procedure can be applied to a majority of change point problems. Furthermore, even with such a mild restriction, the e-cp3o procedure has the ability to detect any type of distributional change within a time series. Simulation studies are used to compare the e-cp3o procedure to other parametric and nonparametric change point procedures, we highlight applications of e-cp3o to climate and financial datasets.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Estimation of Rényi Information Divergence via Pruned Minimal Spanning Trees

In this paper we develop robust estimators of the Rényi information divergence (I-divergence) given a reference distribution and a random sample from an unknown distribution. Estimation is performed by constructing a minimal spanning tree (MST) passing through the random sample points and applying a change of measure which flattens the reference distribution. In a mixture model where the refere...

متن کامل

A Bayesian Approach to Concept Drift

To cope with concept drift, we placed a probability distribution over the location of the most-recent drift point. We used Bayesian model comparison to update this distribution from the predictions of models trained on blocks of consecutive observations and pruned potential drift points with low probability. We compare our approach to a non-probabilistic method for drift and a probabilistic met...

متن کامل

Probabilistic Evaluation of Candidates and Symptom Clustering for Multidisorder Diagnosis

This paper derives a set of formulas for computing the probability of a symptom clustering, given a set of positive and negative findings. Symptom clusterings are produced by a recent method for multidisorder diagnosis that efficiently finds minimal candidates, or disorder combinations, to explain a given set of positive findings. Each symptom clustering represents a collection of such candidat...

متن کامل

Multi-Objective Probabilistically Constrained Programming with Variable Risk: New Models and Applications

We consider a class of multi-objective probabilistically constrained problems MOPCP with a joint chance constraint, a multi-row random technology matrix, and a risk parameter (i.e., the reliability level) defined as a decision variable. We propose a Boolean modeling framework and derive a series of new equivalent mixed-integer programming formulations. We demonstrate the computational efficienc...

متن کامل

Probabilistic optimization via approximate p-efficient points and bundle methods

For problems when decisions are taken prior to observing the realization of underlying random events, probabilistic constraints are an important modelling tool if reliability is a concern. A key concept to numerically dealing with probabilistic constraints is that of pefficient points. By adopting a dual point of view, we develop a solution framework that includes and extends various existing f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015